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(57) Abstract 

A network transmitter schedules packets so that 
packets are transmitted to a host or group of hosts so 
as not to overload any particular part of the network. 
In an embodiment, the transmitter uses packet data 
structures with a schedule indication for packets 
placed in the queue (146a-c) so that an independently 
running adaptor (160) may know when to remove 
packets from the queue (146a-c) and transmit them. 
In an alternative embodiment, packets are scheduled 
by setting a future interrupt for transmitting a packet 
or group of packets. In a further embodiment, packets 
are placed in temporal sets where a temporal set is a 
group of packets that can be transmitted in succession 
without violating the bandwidth limitations of any 
network segment. 
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METHOD AND APPARATUS FOR 
TIME-BASED DOWNLOAD CONTROL 

BACKGROUND OF THE INVENTION 
This application claims priority from provisional 
patent application serial number 60/032,124, filed December 5, 
1996, which discussed a number of background concepts related 
to the invention* 

The current invention relates to the field of elec- 
tronic circuits. More particularly, the current invention 
relates to improvements in networked computer environments and 
has particular applications to the transmission of information 
between digital devices over a communications medium. A wide 
variety of types of computer systems and networks exists, each 
having variations in particular implementations. The present 
invention will be described with reference to particular types 
of systems for clarity, but this should not be taken to limit 
the invention. It will be apparent to those of skill in the 
art that the invention has applications in many different 
types of computer and network systems. The invention there- 
fore should not be seen as limited except as specifically 
provided in the attached claims. 

Digital computer networks have become ubiquitous in 
academic, industry, and office environments. A number of dif- 
ferent aspects of computer networks are discussed in co- 
assigned pending U.S. applications serial nos. 08/313,674; 
08/542,157; 08/506,533; and 08/329 , 714 each of which are 
incorporated herein by reference to the extent necessary to 
understand the invention. 

This specification presumes familiarity with the 
general concepts, protocols, and devices currently used in LAN 
networking and WAN internetworking applications such as, for 
example, the IEEE 802 and ISO 8802 protocol suites and other 
series of documents released by the Internet Engineering Task 
Force. Many examples of such protocols are publicly available 
and are discussed in more detail in the above-referenced 
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patent applications and therefore will not be fully discussed 
here. 

Fia. 1 

Fig. 1 illustrates a local area network (LAN) 40 of 
a type that might be used today in a moderate-sized office or 
academic environment as an example of a type of network in 
which the present invention may be employed. LANs are 
arrangements of various hardware and software elements that 
operate together to allow a number of digital devices to ex- 
change data within the LAN and also may include internet 
connections to external wide area networks (WANs) such as WANs 
82 and 84. Typical modern LANs such as 40 are comprised of 
one to many LAN intermediate systems (ISs) such as ISs 60-62 
and 67 that are responsible for data transmission throughout 
the LAN and a number of end systems (ESs) such as ESs 50a-d, 
51a-c, and 52a-g, that represent the end user equipment. The 
ESs may be familiar end-user data processing equipment such as 
personal computers, workstations, and printers and 
additionally may be digital devices such as digital telephones 
or real-time video displays. Different types of ESs can 
operate together on the same LAN. In one type of LAN, LAN ISs 
60-61 are referred to as bridges and WAN ISs 64 and 66 are 
referred to as routers, and IS 67 is referred to as a 
repeater, however many different LAN configurations are possi- 
ble, and the invention is not limited in application to the 
network shown in Fig. 1. 

The LAN shown in Fig. 1 has segments 70a-e, 71a-e, 
and 72a-e, and 73a. A segment is generally a single intercon- 
nected medium, such as a length of contiguous wire, optical 
fiber, or coaxial cable or a particular frequency band. A 
segment may connect just two devices, such as segment 7 0a, or 
a segment such as 72d may connect a number of devices using a 
carrier sense multiple access/collision detect (CSMA/CD) 
protocol or other multiple access protocol such as a token bus 
or token ring. A signal transmitted on a single segment, such 
as 72d, is generally simultaneously heard by all of the ESs 
and ISs connected to that segment. 
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Drivers and Adaptors 

The ISs and ESs in Fig. 1 generally each includes 
one or more adaptors and a set of drivers. An adaptor general- 
ly includes circuitry and connectors for communication over a 
segment and translates data from the digital form used by the 
computer circuitry in the IS or ES into a form that may be 
transmitted over the segment, e.g. , electrical signals, opti- 
cal signals, radio waves, etc. An ES such as 50b will gener- 
ally have one adaptor for connecting to its single segment. A 
LAN IS such as 61 will have five adaptors, one for each seg- 
ment to which it is connected. 

A driver is a set of instructions resident on a 
device that allows the device to accomplish various tasks as 
defined by different network protocols. Drivers are generally 
software programs stored on the ISs or ESs in a manner that 
allows the drivers to be modified without necessarily 
modifying the IS or ES hardware. 

Adaptor to Host Interface 

Another aspect of networks is the interface between 
the network, the adaptor, and the host operating system. Some 
types of network protocols may require a large amount of 
attention from a host processor. This can be undesirable 
where a host's activity on the network impinges on the host 
processor's ability to perform other host functions. Adaptors 
may also differ in their ability to buffer network traffic. 
Some adaptors rely on the host to buffer most network traffic 
and do not include a large amount of buffer memory on the 
adaptor itself. 

Need For Improved Transmission Scheduling 

In recent years, the amount and type of data that 
users wish to transmit over a network has increased 
dramatically. This increase is not only in the total amount 
of data transmitted, but also in the number of different types 
of data streams that might be carried on the same network. 
Increasingly, users desire a LAN such as that shown in Fig, 1, 
to carry digital data, such as electronic messages or program 
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and data files, real-time audio signals, and real-time video 
signals, all over the same network. These different data 
streams may have widely different requirements in terms of 
total bandwidth and sensitivity to latency and jitter. 

What is needed is an improved network component that 
can effectively operate on a network carrying different types 
of data traffic with different operating speeds and prevent 
slower components in a network from degrading the transmission 
performance of other components* 

SUMMARY OF THE INVENTION 
The present invention comprises techniques and 
devices for improved computer network adaptors and drivers 
that schedule packets to be transmitted so as to optimize 
network performance. In some types of prior art adaptors, 
packets of data are assembled by the host processor and placed 
in a queue in a shared system memory. Once these packets are 
complete, the adaptor generally will, when it is ready, begin 
to download queued packets from shared system memory and 
transmit them over the network when its network connection is 
available. In many prior art adaptors, packets are downloaded 
by the adaptor and transmitted over the network in first- 
in/first-out (FIFO) order, with the first packet placed in 
system memory by the host processor being the first 
transmitted by the adaptor. While the invention has 
advantages in adaptors that transmit packets in FIFO order, it 
may also be used in adaptors that order packets based on 
destinations in order to spread packets, such as described in 

coassigned patent application S/N . 

In general, in prior art networks of the kind des- 
cribed herein, particularly in ethernet-type networks, a 
transmitting host and transmitting adaptor are unaware of the 
speed at which the destination host or the network connecting 
the source to the destination can receive packets. Packets 
are transmitted by the adaptor as soon as the network is 
available. While prior adaptors may observe rules regarding 
prioritizing packets and may defer transmission of lower 
priority packets for higher priority packets, the adaptor does 
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not control the rate at which it transmits packets based on 
the rate that the network or receiver can receive them. 

In the present invention, the host processor, or, 
more precisely, a scheduling process running on the host pro- 
cessor or possibly and alternatively on an adaptor processor, 
attempts to schedule transmitted packets so as not to transmit 
packets faster than a receiver can receive them or, in another 
embodiment, faster than the network (i.e. intermediate 
systems) can deliver the packets. 

According to the invention, a scheduler is provided 
with information regarding bandwidth capabilities of network 
destinations or of intermediate paths in the network through 
which the packets will flow. This information may be provided 
in a variety of ways, including but not limited to: a protocol 
of end-system bandwidth notification as described in 

coassigned, copending application S/N , wherein 

end-systems may communicate with each other and the network 
information regarding their bandwidth capabilities; entry of 
network topology and bandwidth information by a network 
administrator; a mechanism for timing the round-trip or one- 
way delay of a packet delivered to a destination, or any other 
means. The particular mechanism by which the bandwidth 
capabilities of the destination or network is known to the 
transmitter is not a part of the present invention. 

The scheduler, knowing the bandwidth of a particular 
destination, determines a schedule for transmitting packets so 
that a destination does not receive packets faster than it can 
handle those packets. The scheduler may do this according to 
a variety of algorithms. These various scheduler algorithms 
may take into account a number of different network 
characteristics, including but not limited to the buffer 
capacity of various intermediate systems (which may be thought 
of as network elasticity) , alert response latency of a 
particular adaptor or operating system, minimum jitter 
requirements, or any other inherent or desired characteristic 
of network transmitter performance. 

The scheduler controls transmission of packets by 
providing for packet transmission alerts to the adaptor to 
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transmit the packets. Two specific alternative mechanisms for 
generating alerts are described in detail below (interrupt- 
based download control and descriptor-based download control) , 
but other mechanism for generating alerts are possible within 
the invention . 

In interrupt-based download control, the scheduler 
sets (or programs) a future interrupt for a to-be-transmitted 
packet based on the schedule for that destination. When the 
interrupt occurs, the driver instructs the adaptor to transmit 
the scheduled packet. 

In descriptor-based download control, the scheduler 
attaches a scheduling indication to packets placed in system 
memory, and the adaptor uses that indication to determine when 
to download that packet from system memory and transmit the 
packet on the network. 

According to the invention, other techniques, 
including grouping packets into temporal sets, may be used by 
the scheduler to control the flow of transmitted packets to 
optimize network performance. Temporal sets are groups of 
packets that the scheduler determines can be transmitted in 
back-to-back fashion without violating the target bandwidth of 
any destination. When packets are grouped into temporal sets, 
the scheduler may set just one alert for a group of packets. 

Specific aspects of the invention will be better 
understood upon reference to the following description of spe- 
cific embodiments and the attached claims. 

BRIEF DESCRIPTION OF THE DRAWINGS 

FIG. 1 is a diagram of a moderate sized network of 
one type in which the invention may be effectively employed; 

FIG. 2A is a diagram illustrating a simplified 
network consisting of a transmitter transmitting to three 
receivers to illustrate aspects of the invention; 

FIG. 2B is an alternative diagram illustrating the 
network of FIG 2A using one particular type of switching 
device to implement the network. 
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FIG. 3 A is a block diagram of an end system having 
system memory and an adaptor using descriptor based download 
control according to one embodiment of the invention. 

FIG. 3B is a block diagram of an end system having 
system memory and an adaptor using interrupt based download 
control according to one embodiment of the invention. 

FIGs. 4A and 4B illustrate adding a packet to a 
temporal set according to an embodiment of the invention. 

FIG. 5 illustrates one example of a packet data 
structure (stored in system memory) according to an embodiment 
of the invention. 

FIG. 6 illustrates the format of a ScheduleTime 
value according to the invention. 

FIG. 7 is a block diagram of the download and 
scheduling logic used in the adaptor shown in FIG. 3C. 

FIG. 8 illustrates the state sequence for the 
schedule time polling of the circuit illustrated in FIG. 6. 

FIG. 9 illustrates the dnPollCnt counter according 
to a specific embodiment of the invention. 

FIG. 10 shows the real time count register according 
to one embodiment of the invention. 

FIG. 11 illustrates the dnPollCnt register according 
to an embodiment of the invention. 

DETAILED DESCRIPTION OF SPECIFIC EMBODIMENTS 

The invention consists of a number of functional 
units which work together to effect methods of the invention. 
It will be apparent to those of skill in the art and from the 
description below that some of these functional units can be 
employed independently to accomplish aspects of the invention. 
It will also be apparent that various alternative components 
described below can be combined to construct a system 
according to the invention. 



Packet Scheduling to Control Transmitter Bandwidth 

FIGS. 2A and 2B are alternative simplified diagrams 
of a network showing a transmitter T connected to three 
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receivers A, B, and C as an example of a network connection in 
order to illustrate aspects of the invention. 

FIGS 3A and 3B are each diagrams of an ES such as 
500b with an adaptor according to alternative embodiments of 
the invention. FIGS. 3 A and 3B both show a host 110 having a 
CPU process 115, system memory 14 0, and adaptor 160. Within 
host 110's processor and address space are higher layer 
network protocols 120, communicating with adaptor 160 through 
driver 125. Driver 125 operates in conjunction with a 
scheduler 130. According to the invention, scheduler 130 
determines a schedule for packets to be transmitted based on a 
target bandwidth for a destination or group of destinations. 

Where FIGS 3A and 3B differ is the manner in which 
scheduler 130 sets alerts to control packet transmission in 
order to schedule packets to meet the target transmission 
bandwidth, as discussed in the section below. 

Scheduler 13 0 determines a time that the next packet 
for a given destination should be gueued on the adaptor for 
transmission by computing a current time (i.e. the download 
start time for the present packet to that destination) plus 
the time it will take to transmit the present packet on the 
network at the target bandwidth rate known to the scheduler, 
and then possibly adjusted by values that represent the local 
transmitter connection transmit time or possibly other local 
or network intermediate system delay times. 

As described below, the invention may include a 
factor that causes packets to be transmitted from the 
transmitter slightly sooner than the scheduled time for the 
next packet, relying on elasticity (i.e. buffering capacity) 
in network intermediate systems to not deliver packets to the 
ultimate destination before the destination can receive them. 

The target destination bandwidth rate may be set 
according to the actual bandwidth that can be handled by the 
destination, or may be set for other reasons, such as the 
control of different types of traffic or the carrying of 
traffic based on charges levied for access of the network. 
While the invention is designed for use in networks with 
variable-width or with fixed-width packets, the invention has 



WO 98/27696 9 PCT7US97/22151 

particular application when large amounts of data are being 
transmitted to a destination, and in this case, packets will 
often be of a maximum (and therefore fixed) size. Therefore, 
packets of the same size are assumed in some of the following 
examples. 

The operation of a scheduler such as 13 0 according 
to the invention will now be explained with reference to the 
example connections shown in FIGS. 2A-B. For purposes of this 
discussion, assume that T is a 100 Mbps device on a 100 Mbps 
link transmitting packets to destinations B and C, each having 
a target bandwidth of 10 Mbps, and to destination A having a 
target bandwidth of 3 0 Mbps. It will be obvious that many 
other target destinations are possible. If each packet takes 
12 0 microseconds (/xs) to transmit over the network at 100 
Mbps, and therefore 1200 microseconds at 10 Mbps, according to 
the invention, if a packet is scheduled for destination B at 
time 0, the next packet for destination B should be queued by 
scheduler 130 for download at 0 + 1200 or 1200 /xs. In a 
different embodiment, the transmit time may be adjusted by a 
value to take into account various latencies that may be 
associated with the network or to take advantage of network 
elasticity. For example, a value such as 120. ^s might be 
subtracted from the queue time for the next packet (perhaps 
representing the local transmitter time from T to the network) 
in which case the transmit time for the next packet would be 0 
+ 1200 - 120, or 1080, microseconds. This would cause the 
transmitter to slightly "lead" the target bandwidth, but with 
the assumption that the extra bandwidth can be absorbed by 
buffering capacity in network intermediate and end system 
adaptors . 

Further, the above formula assumes that the adaptor 
can transmit a packet as soon as it downloads the first byte 
from host memory, as described in coassigned patent 5,412,782. 
If the adaptor does not have that capability, the scheduling 
is adjusted by adding the delay time it takes to download the 
packet from the host to the adaptor to the 12 0 value 
subtracted from the packet transmit time. 
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The third packet: for destination B is scheduled for 
download at 1200 + 1200 or 2400 microseconds. The fourth is 
scheduled for 1200 + 1200 + 1200 or 3600 microseconds. In 
each of these cases, the schedule could also be adjusted 
either to cause leading in the network or to take into account 
latency as described above. A packet schedule is similarly 
determined for other destinations for which the scheduler has 
received packets. 

Temporal Sets Simplify Controlling Transmission of Packets 

According to a further embodiment of the invention, 
a scheduler can schedule a number of packets to one, or to a 
variety of destinations, together in a temporal set. A 
temporal set is composed of a set of packets which can be 
transmitted in a back-to-back fashion by the adaptor on its 
network link without violating the maximum target bandwidth of 
any destinations. 

Temporal sets are defined by scheduler 13 0 according 
to a number of alternative methodologies, depending on the 
requirements of the particular application. 

In one embodiment, for each destination, or group of 
linked destinations, scheduler 13 0 determines a desired 
transmit time for each packet as described above for 
destination B such that packets to that destination will be 
transmitted near the target bandwidth of that destination. 
Scheduler 130 then places as many packets as possible in a 
temporal set, essentially filling up the set with packets so 
long as there is a packet available for some destination that 
could be transmitted immediately after the previous packet 
(i.e. transmitted back-to-back) without violating the 
bandwidth of any destination. When there are not more packets 
ready to be queued that meet that rule, scheduler 13 0 closes 
the temporal set and develops an alert for a future time 
determined by the next packet that needs to be delivered to a 
destination. Scheduler 130 places packets into their 
appropriate temporal sets and adjust sets as described below. 
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Placing a new packet in a temporal set 

A temporal set is characterized by a temporal set 

download start time (Set_TxQueue__Time) and an estimated set 

transmission end time (Set_JTx_JEnd) for packets currently in 

the temporal set. According to one embodiment of the 

invention, a packet is placed in a temporal set by scheduler 

130 when that packet 1 s desired download start time 

(PackBt_TxQueue__Tlme) is greater than or equal to 

Set_TxQueuB_Time and is less than or equal to Set_Tx__End plus 

the packet transmission time (Packet_Tx_Time) . Furthermore, 

according to one embodiment, the set's start time 

(SBtJTxQueue_T±me) and S&t_Tx_End are adjusted by a value 

determined by the start time and transmission time of the 

packet. One embodiment is represented by the code below: 

If Packet_TxQueuejrime + Packet_Tx_Time < Set_Tx_End 

SetJTxQueueJTime = SBt__TxQueue_T±me - Pack&tJTx_TimB 

El SB 

Diff_TimB = PackBt_TxQuBUB_T±mB + PackBt_Tx_TimB - 
Set_Tx_End 

Set_Tx__End = PackBtJFxQuBUBjTimB + PackBt__Tx TIjub 

S b t TxQub u e TiuiB = SBt__TxQuBUBjTimB - Dlff tTiub 

End ~~ 

FIG. 4 illustrates the mechanism described above for 
scheduling a new packet received by scheduler 13 0 once a 
number of temporal sets have been defined for packets to be 
transmitted, according to one embodiment of the invention. In 
FIG. 4, assume that a packet Dl is presented to scheduler 13 0 
after sets 1 and 2 have been defined as shown. According to 
one embodiment, Dl is placed in temporal set 1 if Dl 1 s 
PackBtJTxQuBUBjTimB (or PackBt_Download_TimB) is greater than 

or equal to Set l's S e t TxQu e u e Ti m e (or SBt^DownloadJTimB) 

and less than or equal to the SBt_Tx_End_TimB plus Dl's 
PackBt_Tx_Durat±on . In other words, Dl is added to the set if 
its scheduled PackBtJDownload_TlmB falls within the window as 
shown in FIG. 4. 

According to a further specific embodiment, a 
temporal set's SBtJDownloadJTimB may be adjusted when a new 
packet needs to be added to a temporal set. This may result 
in some packets in the temporal set being transmitted slightly 
ahead of their desired scheduled time, but this in general is 
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not. a problem in real networks because the intermediate 
devices such as 67 in the networks provide some elasticity (or 
intermediate buffering capacity) allowing them to accept a 
packet on one port even if they are not yet ready to transmit 
it out of another port. The following code provides one 
example of adding Dl to the end of a set taking advantage of 
this elasticity, with the cases as illustrated in FIG . 4: 

Case A: If Packet^Download^Tlme + Packet^Tx^puratlon < Set Tx End Time 

Set^DownloadJTlme = Set__Download_JFlme - Packet _Tx_Duratlon 
Case B: Else 

Overlap JTlme = SetJTx__EndJTlme - Packet JDownloadJTlme 
Set__Tx_End_Tlme = Packet__DownloadJTlme + Packet_Tx_Duratlon 
Set^Download_Tlme = Set_Download__Tlme - Overlap Time 
End If 

It will be seen that in Case A, the new packet fits 
entirely within existing Set_Tx_T±me and scheduler 13 0, 
according to the invention, adjusts the set's download time 
earlier and adds the new packet to the end of the set. In 
Case B, a new packet overlays the end of existing Set_Tx__Tlme , 
but still meets criteria for being added to the set. 
Scheduler 130 in this case will adjust the set's download time 
earlier by the amount of non-overlap, and add the new packet 
to end . 

It will be apparent to those of skill in the art 
that other formulas for adjusting the set's start time are 
possible and may be adjusted depending on the characteristics 
of the particular network. 

As a further example, assume destinations A, B, and 
C, all connected via a network to a server/ transmitter T, as 
shown in Fig. 2. A is connected directly to a 100 megabit 
network, but can only receive packets at 3 0 mbps. B and C are 
10 megabit systems connected via a buffering switch to the 
same 100 megabit network. Assume the jitter estimate is 50 
microseconds. Assume the following packets are submitted for 
transmission to a scheduler according to the invention at 
transmitter T with packets Al being the first packet submitted 
with a destination A, packet A2 being the second packet for A, 
etc: Al, A2, A3, A4 , Bl, B2 , B3 , CI, C2 , C3 . For simplicity, 
assume each packet takes 12 0 microseconds to transmit at 100 
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10 



15 



20 



25 



30 



35 



40 



45 



Mbps, 400 microseconds at 30 Mbps, and 1200 microseconds at 10 
Mbps. With all of this information known to scheduler 13 0 at 
transmitter T, scheduler 130 would take the actions shown in 
Table 1. 

Scheduler 130 would first determine a desired 
transmission schedule for the three destinations as follows: 
Al = 0, A2 = 400, A3 = 800, A4 - 1200; 
Bl = 0, B2 = 1200, B3 = 2400; 
CI = 0, C2 = 1200, C3 = 2400. 

_ Table 1 ~~ 



Real-time 
Count 



Packet 

Download 

Time 



Assumed 
Local Tx 
complete 



Programmed Alert 


-50 


NA 


NA 


Temporal Set 1 








Alert 


0 


NA 


NA 


Queue Al 


0 


0 


120 fJB 


Queue Bl 


0 


0 


240 fJB 


Queue A2 


0 


400 


360 fjB 


Queue CI 


0 


0 


480 jis 


Program Alert 


0 




for time 630 fjB 








Temporal Set 2 








Alert 


630 /js 


NA 


NA 


Queue A3 


680 fjB 


800 


800 fJB 


Queue B2 


680 /js 


1200 


920 fjs 


Queue C2 


680 /jb 


1200 


1040 fJB 


Queue A4 


680 fjB 


1200 


1160 fJB 


Program Alert 


680 ^8 


NA 


NA 


for 2110 fjB 






Temporal Set 3 








Alert 


2230 


NA 


NA 


Queue B3 


2280 hb 


2400 


2280 fJB 


Queue C3 


2280 ps 


2400 


2400 fJB 



(Alert latency is 50 ^/s) 
(Alert latency is 50 ^jb) 

(680 = 800 - 120 - 50) 

(Alert latency is 50 /js) 
(800 >= 1200 - 360 - 50) 

(2230 = 2400 - 120 - 50) 
(Alert latency is 50 /js) 



50 



Note that in this particular embodiment the 
scheduler does not take into account the actual download time 
or transmission time of a packet for a given destination when 
scheduling subsequent packets for that destination. Though 
packet CI, for example, is not completely locally transmitted 
into the network until time 480 ixs , scheduler 13 0, in this 
embodiment, schedules packet C2 as though it should ideally be 
transmitted at 1200 jxs. In this embodiment, scheduler 13 0 is 
primarily concerned with controlling bandwidth to a 
transmitter, and assumes that once the schedule for a given 
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destination is set to control the bandwidth to that 
destination, elasticity within the network can handle the 
leading of some packets. This assumption both simplifies the 
design of scheduler 13 0 and leads to improved network 
5 throughput* Other embodiments of the invention of more 

complex design are possible where scheduler 13 0 accounts for 
the delays caused to one destination by congestion from 
another destination in the scheduling of packets. However, in 
many situations the embodiment just described will provide 
10 superior network throughput and will be more simple to 
implement. 

In a further embodiment , an adaptor driver residing 
in a host may perform set gluing, whereby when a set's start 
time or end time are adjusted so that it overlaps with an 
15 adjacent set, temporal sets are combined so that the host 
receives just one alert to initiate transmission of the 
packets . 

Mechanisms for alerting the host to send scheduled packets 
20 Once the scheduler 130 has determined a schedule for 

packets in order to control transmission bandwidth, according 
to further alternative embodiments of the invention, scheduler 
13 0 employs various mechanisms to set some future time at 
which the adaptor will be instructed to transmit a packet or a 
25 group of packets. The time-based downloading just described 
may be used with either of the below described alert 
mechanisms or with other alert mechanisms. 

Descriptor Based Alerts Scheduling 

30 FIG. 3A illustrates a scheduler in an adaptor 

employing descriptor based scheduling. FIG. 3B illustrates an 
alternative embodiment of a scheduler in an adaptor employing 
programmed interrupt alerts to schedule packets and will be 
described in more detail below. According to this embodiment, 

35 scheduler 130 uses bits placed in packet descriptors to 

specify a time at which packet download will begin, in either 
absolute or relative terms. An adaptor according to the 
invention examines the scheduling indication in a packet 
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descriptor and uses that indication to determine when packets 
should be downloaded from system memory and transmitted on the 
network. According to this aspect of the invention, each 
packet descriptor may carry a scheduling indication, or 
5 packets may be grouped in temporal sets, with the indication 
for the first packet in the group being the alert to the 
adaptor to then transmit the set. 

In one embodiment, adaptor 160 reads schedule time 
144a from the data structure 142a and compares that to a real- 

10 time counter 162. If real-time counter 162 has a lower value 
than the schedule time, the adaptor waits and does not 
transmit the packet on the network. If the real-time counter 
has a higher value than the schedule time, the adaptor down- 
loads the packet for transmission. In one embodiment, a 

15 packet is stored in one or more packet data buffers I46a-c in 
system memory 14 0. 

FIG. 5 illustrates one example of a packet 
descriptor data structure 14 2 (stored in system memory) 
including schedule indications according to a specific 

2 0 hardware embodiment of the invention. In some embodiments, 

this descriptor will be stored in a different place than the 
packet data, and the descriptor will be separately passed to 
the adaptor before the adaptor downloads the data from system 
memory for transmission. The labels in the figure indicate 
25 the following: 

DnNextPtr points to the next data structure of this 
kind, allowing a chained list of packet data structures, which 
can be used for indicating the next packet in a temporal set; 

ScheduleTime contains information fields controlling 

3 0 when this packet is to be downloaded; 

Frames tartHeader contains control information for 
the packet; 

Each DnFracrAddr/DnFragLen pair describes a data 
buffer in system memory (such as 146a and 146b) . The buffers 
35 are downloaded (moved from system memory to the adaptor) one 
by one; together the buffers comprise the packet to be 
transmitted. 
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FIG. 6 defines the format of ScheduleTime as stored 
with a packet descriptor in system memory according to one 
embodiment of the invention. In this embodiment, an initial 
value is loaded into counter 162 when a scheduler first begins 
5 scheduling a stream of packets. This value initiates the 

RealTimeCnt and in many cases can be thought of as zero. The 
labels in the figure indicate the following: 

loadTimeCnt [bit 28]: This bit, when set, instructs 
the adaptor to load the value in the scheduleTime field into 
10 the RealTimeCnt register, and download the packet immediately. 
When loadTimeCnt is set, scheduleTimeValid is ignored by the 
adaptor. 

scheduleTime [bits 23:0]: Provides either a count 
value time at which to download this packet, or a value to be 
15 loaded into RealTimeCnt to initiate counter 162, or is 
ignored, depending upon the values of load TimeCnt and 
scheduleTimeValid . 

When scheduleTimeValid is set, scheduleTime 
represents the time at which the packet is to be downloaded. 

2 0 When the adaptor sees scheduleTimeValid set, circuits therein 

compare scheduleTime against the value in the RealTimeCnt 
counter register. If RealTimeCnt is less, then the adaptor 
goes into a polling mode, in which it periodically re-fetches 
the ScheduleTime word and compares it against RealTimeCnt . 
25 The poll rate is determined by the value in DnPoll . 

ScheduleTime specifies a time in increments of 800 ns. 

scheduleTimeValid [bit 29]: When set, indicates that 
scheduleTime contains a value that is to be compared to the 
RealTimeCnt. When not set, indicates that the packet is to be 

3 0 transmitted under control of a preceding packets scheduleTime , 

i.e. it indicates that the packet is part of a temporal set 
whose start time is located in an earlier packet's descriptor. 

FIG. 7 is a block diagram of the operation of 
download and scheduling logic 161 and FIG. 8 illustrates the 
35 state sequence for testing schedule time and for polling. As 
shown in FIG. 7, a packet is downloaded from system memory to 
the adaptor when one of these three conditions is true, as 
determined by comparison logic 166 and conveyed to state 
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machine 174 via the download packet signal: (1) the packet 
data structure contains an initial value to be loaded into 
RealTimeCnt f loadTimeCnt = 1) ; (2) the packet data structure, 
as placed in DataHoldReaister 168, contains a ScheduleTime 
5 entry, and RealTimeCnt > ScheduleTime . (If RealTimeCnt < 
ScheduleTime , the adaptor waits a time period specified by 
dnPoll [6:0] and ref etches ScheduleTime . ) ; or (3) both 
LoadTimeCnt and ScheduleTime Valid in the packet data 
structure are zero (indicating there is no meaningful data in 
10 ScheduleTime, and that the packet is part of a temporal set 
whose start time is determined by an earlier packet in the 
set. 

FIG. 9 shows the manner in which the dnPollCnt 
counts down once every pulse of a 312 0 Khz clock from the 

i5 value loaded from DnPoll register. 

FIG. 10 shows real time count register 162 according 
to one embodiment of the invention. RealTimeCnt is a real- 
time counter which supports the packet download scheduling 
function. RealTimeCnt counts continuously, incrementing every 

20 800 ns (0.8 /us) , and wrapping to zero when it reaches its 
maximum value. When a transmit packet is scheduled for 
download, the download starts when RealTimeCnt is greater than 
or equal to the value in the packet data structure's 
ScheduleTime field. RealTimeCnt is loaded with the value in 

2 5 ScheduleTime when the loadTimeCnt bit is set. This has the 
side effect of causing the packet to be downloaded 
immediately. RealTimeCnt is cleared by reset. 

FIG. 11 shows the register format wherein the value 
in DnPoll determines the rate at which the current packet data 

30 structure is polled. Packet data structures are polled for 
two different reasons: (1) When a zero DnNextPtr is fetched 
from the current packet data structure, DnNextPtr is polled to 
determine when a new packet data structure is ready to be 
processed; (2) When packet download is delayed with 

35 ScheduleTime, the packet data structure is polled to determine 
when the RealTimeCnt has reached the required ScheduleTime. 
Polling is disabled when DnPoll is cleared. DnPoll is cleared 
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by reset. The value in DnPoll represents 320 ns time 
intervals. The maximum value represents 40.64 /xs. 

Fig. 3B illustrates an alternative embodiment of the 
invention including an interrupts based alert mechanism. In 
this embodiment, scheduler 13 0 programs an interrupt in 
interrupt timer 190. When the time for the interrupt arises, 
interrupt 19 0 generates an interrupt which is received by CPU 
process 115. CPU 115 then alerts driver 125 that an interrupt 
has occurred. Driver 125 may then initiate packet fetch 
through packet controller 192 for the next packet or group of 
packets and scheduler 13 0 may then set a next interrupt. Note 
that interrupts may be generated on a packet by packet basis, 
or interrupts may be taken for temporal sets of packets which 
are then transmitted back-to-back on the network. 

The invention has now been explained with reference 
to specific and alternative embodiments. Other embodiments 
will be obvious to those of skill in the art. The invention 
therefore should not be limited except as provided for in the 
attached claims as extended by allowable equivalents. 
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WHAT IS CLAIMED IS: 



1 1. A method for scheduling packets to be down- 

2 loaded from a host to an adaptor in order to control packet 

3 transmission rate comprising: 

4 creating a temporal set of packets that can be 

5 transmitted in a back-to-back fashion without violating a 

6 maximum bandwidth requirement of said packets 1 target 

7 destination addresses; 

8 characterizing said temporal set by a download start 

9 time and an estimated transmission end time; 

10 grouping a packet into a temporal set based on a 

11 grouping rule for said packet; and 

12 adjusting said temporal sets' download start time 

13 and estimated transmission end time according to said grouping 

14 rule. 

1 2. The method according to claim 1 wherein said 

2 grouping rule involves grouping a packet into a temporal set 

3 when the packet's download start time is greater than or equal 

4 to the set download start time and less than or equal to the 

5 set transmission end time plus the packet transmission time. 

1 3 . The method according to claim 1 wherein said 

2 grouping rule involves adjusting the set download start time 

3 by an amount determined by an overlap between the desired 

4 packet download time of said new packet and said set download 

5 start and end times. 

1 4. The method according to claim 3 wherein said 

2 grouping rule involves adjusting the set transmission end time 

3 by an amount determined by the time necessary to transmit the 

4 new packet. 

1 5. The method according to claim 2 wherein sched- 

2 uling may be represented by the formula: 

3 If Packet TxQueue Time + Packet Tx Time < Set TX End 
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4 Set TxQueue Time = Set TxQueue Time - Packet Tx 

5 Time, 

6 Else Diff Time = Packet TxQueue Time + Pacicet Time - 

7 Set Tx 5nd 

8 Set Tx End = Packet TxQueue Time + Packet Tx Time 

9 Set TxQueue Time = Set TxQueue Time - Diff Time . 

1 6. The method according to claim 1 further com- 

2 prising computing a download delay until a next temporal set. 

1 7. The method according to claim 6 further com- 

2 prising generating a programmed interrupt at the time for the 

3 next set. 

1 8. The method according to claim 4 further com- 

2 prising set gluing, whereby temporal set are combined when the 

3 start time of one temporal set approaches the end time of 

4 another temporal set. 

1 9. The method according to claim 6 further com- 

2 prising including bits in packet descriptors in system memory 

3 specifying the time at which packet download will begin in 

4 either absolute or relative terms. 

1 10. A network transmitter capable of controlling 

2 its rate of transmission to meet a target bandwidth 

3 comprising: 

4 a destination bandwidth identifier for 

5 determining a target bandwidth for a particular destination or 

6 group of destinations; 

7 a scheduler for determining when a next packet 

8 for a particular destination should be queued for transmit in 

9 order to meet said target bandwidth and for programming an 

10 alert; 

11 a programmable alert controller for causing 

12 transmission of said next packet; and 

13 an interface for receiving packets to be 

14 transmitted; 
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1 11. The device according to claim 10 further com- 

2 prising: 

3 a set builder for grouping scheduled packets 

4 into sets in order to allow said alert controller to cause an 

5 alert for a set of packets, said set of packets transmitted in 

6 back-to-back fashion. 

1 12. The device according to claim 10 wherein said 

2 alert controller comprises an interrupt controller for 

3 programming a future interrupt to initiate the future 

4 transmission of a packet. 

1 13. The device according to claim 10 wherein said 

2 alert controller comprises a packet descriptor scheduler for 

3 placing schedule time indications in packet descriptors to 

4 indicate download times for a packet. 



1 

2 



14. The device according to claim 10 wherein said 
network transmitter comprises an adaptor driver. 
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